Distribution-Aware Online Classifiers
نویسندگان
چکیده
We propose a family of Passive-Aggressive Mahalanobis (PAM) algorithms, which are incremental (online) binary classifiers that consider the distribution of data. PAM is in fact a generalization of the Passive-Aggressive (PA) algorithms to handle data distributions that can be represented by a covariance matrix. The update equations for PAM are derived and theoretical error loss bounds computed. We benchmarked PAM against the original PA-I, PA-II, and Confidence Weighted (CW) learning. Although PAM somewhat resembles CW in its update equations, PA minimizes differences in the weights while CWminimizes differences in weight distributions. Results on 8 classification datasets, which include a real-life micro-blog sentiment classification task, show that PAM consistently outperformed its competitors, most notably CW. This shows that a simple approach like PAM is more practical in real-life classification tasks, compared to more sophisticated approaches like CW.
منابع مشابه
Online multiple people tracking-by-detection in crowded scenes
Multiple people detection and tracking is a challenging task in real-world crowded scenes. In this paper, we have presented an online multiple people tracking-by-detection approach with a single camera. We have detected objects with deformable part models and a visual background extractor. In the tracking phase we have used a combination of support vector machine (SVM) person-specific classifie...
متن کاملCoCoST: A Computational Cost Sensitive Classifier
Computational cost of classification is as important as accuracy in on-line classification systems. The computational cost is usually dominated by the cost of computing implicit features of the raw input data. Very few efforts have been made to design classifiers which perform effectively with limited computational power; instead, feature selection is usually employed as a pre-processing step t...
متن کاملIRIT @ TRECVid 2010 : Hidden Markov Models for Context-aware Late Fusion of Multiple Audio Classifiers
This notebook paper describes the four runs submitted by IRIT at TRECVid 2010 Semantic Indexing task. The four submitted runs can be described and compared as follows: • Run 4 – late fusion (weighted sum) of multiple audio-only classifiers output • Run 3 – context-aware re-rank of run 4 using hidden Markov model • Run 2 – context-aware late fusion of multiple audio classifiers output with hidde...
متن کاملSignature Verification using Integrated Classifiers
This paper presents a new approach for off-line signature verification. The proposed system is based on global, grid, ink distribution and texture features. The Boosting algorithm is applied to train and integrate multiple classifiers, and the distance-based classifier used as the base classifier corresponding to each feature set. Adaptive threshold is associated with individuality. Experimenta...
متن کاملAutomating the Quality Assurance of an On-line Knowledge-Based Classifier By Fusing Multiple Off-line Classifiers
We address two problems in the lifecycle of a production classifier: the monitoring of its decisions quality and the updating of the classifier over time. The proposed architecture consists of four off-line classifiers and an associative fusion module. The fusion is a T-norm based outer-product of the classifiers’ normalized outputs. By attaching a confidence measure to each output of the fusio...
متن کامل